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Abstract. We give a combinatorial interpretation of a classical meta-Fibonacci se- 
quence defined by G(n) = n — G(G(n — 1)) with the initial condition G(l) = 1, which 
appears in Hofstadter’s “Godel, Escher, Bach: An Eternal Golden Braid”. The interpre- 
tation is in terms of an infinite labelled tree. We then show a couple of corollaries about 
the behaviour of the sequence G(n) directly from the interpretation. 


1. Introduction. In his famous book Godel, Escher, Bach: An Eternal Golden Braid, 
Douglas Hofstadter introduced his G-sequence [4, p. 137] defined as 

G(n) = n - G(G(n - 1)) ; G(l) = 1. (1) 

This recursion is part of the general family of recursions given by G(n) = n — G{G k {n — 1)) 
with initial condition G(l) = 1. The superscript of k > 1 means a fc-fold composition 
of the function G(n). Recursions of this form, where the argument of the defining terms 
depend on previous values of the recursive function, are called meta-Fibonacci or nested 
recursions. There is knowledge about Hofstadter’s G-sequence in literature nowadays, but 
little is known about the other fc-fold recursions above. 

Let F n denote the Fibonacci numbers, defined by F. n = F„_i + F n _i and F\ = F% = 1. 
Meek and Van Rees [5] showed that if n = F ri — • + F r . is the Zeckendorf representation 1 
of n then G(n) = F ri _i + • • • + F r ._ i. Soon both Granville and Rasson [3], and Downey 

and Griswold [2] showed that G(n) = |_<A + where </> = ^ +1 i s the golden ratio. 

Our result is an interpretation for G(n) in terms of counting labels in an infinite labelled 
tree. After our discovery we learned that the result was known by some in meta-Fibonacci 
circles 2 , but to the best of our knowledge there is no published proof of this combinatorial 
interpretation in literature. The purpose of this paper is not only to give a proof of the 
interpretation, but also to provide motivation for adapting this approach in order to find a 
combinatorial interpretation of the related /c-fold recursion above. 


2000 Mathematics Subject Classification. Primary: 05A15; Secondary: 11B37, 11B39. 

Key words and phrases, met a- Fibonacci recursion, Hofstadter’s G— sequence. 

1 Given any positive integer n, it is possible to write n uniquely as n = F ri +• • • + F r . where > r^_ |_i +2 

for 1 < i < j — 1. This is called the Zeckendorf representation of n. 

2 In private communication with Steve Tanny, Hofstadter recalled having prior knowledge of this 
interpretation. 
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2. Combinatorial interpretation of the recursion G(n). Throughout this section we 
will refer to Hofstadter’s G-sequence as defined in (1) as G(n). Table 1 contains the first 20 
values of G(n). Notice from this table that the difference between a term of G(n) from the 
previous term is always 0 or 1. Such a sequence of positive integers is called slow-growing. 
It is not difficult to show via induction that G(n) remains slow-growing for all n. More 
importantly, let /(n) = #G“ 1 ({n}) for n > 1, which we call the frequency sequence of 
G{n). The frequency sequence displays many patterns (see Table 2), for example, it consists 
only of Is and 2s. In fact, let us interpret the frequency sequence as an infinite word W = 
n~i/(n) where the product stands for concatenation. Setting w\ = 2,W2 = l,n ; 3 = 2, 
and w n = w n -\w n -2 for n > 4, we can experimentally verify that the initial segments of 
the word W factorizes as W = WxW2W^w\ ■ ■ ■ w^. Such observations about the frequency 
sequence motivate us to define a labelled tree such that for each n, G(n) is the label of the 
parent of vertex n + 1. 


Table 1. First 20 terms of G(n) 


n 

1 


4 

4 

5 


7 


AT 

IT) 

11 


lo 


15 

16 

IT 

18 

19 


~G^Y 

1 

1 

2 

3 

3 

4 

4 

5 

6 

6 

7 

8 

8 

9 

9 

10 

11 

11 

12 

12 


Table 2. First 20 terms of the frequency sequence f(n) 


n 

1 


3 

4 

NT 


7 


AT 

10 

11 

12 

13 

iA 

15^ 

16 

IT 

18 



fin) 

2 

1 

2 

2 

1 

2 

1 

2 

2 

1 

2 

2 

1 

2 

1 

2 

2 

1 

2 

1 


This tree, denoted as Q, is defined recursively as follows. Q has a root vertex whose left 
subtree is a copy of Q. The root also has a right child which itself has a copy of Q as its only 
subtree. The definition is pictured in the left-half of Figure 1. The labeling procedure works 
by denoting the root of Q as vertex 1, and then labeling the vertices in increasing order of 
their height from the root. All vertices at a specific height are labelled in increasing order 
from right to left. The right half of Figure 1 shows Q labeled up to height three. 



Figure 1. Q is defined on the left. To the right is Q labelled up to height 4. 


Lemma 1 . The tree Q contains F^+2 vertices at height h. Consequently, the number of 
vertices in Q from height 0 through h is Fh+ 4 — 2. 
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Proof. From the definition of Q we note that vertices at height h of G consists of vertices 
at height h — 2 of Q contributed by the right subtree of the root and vertices at height 
h — 1 contributed by the left subtree of the root. As such, the number of vertices at height 
h satisfies the Fibonacci recursion with initial conditions F-i and F$ for heights 0 and 1 
respectively. It follows that there are Fh + 1 vertices at height h and that the number of 
vertices from height 0 to h is + ■ ■ ■ + Fh + 1 = Fh + 4 — 2. We deduce the latter from the 
well-known Fibonacci identity Fi + • • • + F^ = Fh + 2 — 1- See, for example, [1, p. 2] where a 
combinatorial proof is provided. □ 

Remark 1. We could alter the tree G to make it so that G(n) is the label of the parent of 
vertex n. To do this we need to insert a new vertex below the current root of Q and then 
attach it to the root. We then redo the labeling starting from the new vertex. However, 
there are certain advantages with the current definition. First, we think that it makes the 
recursive definition of Q more pleasant. Further, the definition as it stands will make some of 
the computations in the proof of the combinatorial interpretation easier and the exposition 
more clear. 

Theorem 1. Let g{n) denote the label of the parent of vertex n + 1 in the infinite tree Q . 
Then g(n ) satisfies the recursion g(n) = n — g{g(n — 1)) with the initial value g(l) = 1. 

Proof. The proof is by induction on the vertex labels n. By comparing the values of G{n) 
from Table 1 with the values of g{n) from Figure 1, we see that the theorem holds for the 
first 19 vertices, which consists of all vertices of G up to height 4. Now suppose that vertex 
n + 1 is located at height h > 4 and that the theorem holds for all vertices in Q preceding 
n + 1. 

As h > 4, vertex n + 1 is either located on the left subtree of vertex 1 or on the subtree 
emanating from vertex 2. Both these subtrees are a copy of G ■ Denote the subtree containing 
vertex n + 1 as Q' , and the subtree not containing n + 1 as G" ■ If we now remove all the 
labels of G' that it inherits from Q, and relabel Q' in the same manner as we labeled Q 
starting from 1, then vertex n + 1 of G will receive a new label no + 1 on Q' with no < n 
since no + 1 is located at a lower height in Q' . (Specifically, at height h — 1 of Q' if Q' is the 
left subtree of vertex 1 or at height h — 2 of Q' if it is the subtree of vertex 2.) As an example, 
consider n + 1 = 10 in which case Q' is the left subtree of vertex 1 and no + 1 = 5. We 
establish the recursive formula at index n through its validity at index n 0 and by evaluating 
the differences (n + 1) - (n 0 + 1), g{n) - g(n 0 ), and g{g(n - 1)) - g{g(n 0 - 1)). 

The difference (n + 1) — (no + 1) is the number of vertices in Q consisting of vertices 1, 
2, and all the vertices preceding n + 1 that are in Q" . Such vertices preceding n + 1 in Q" 
consist of all vertices from height 0 to h — 2 of Q" . As Q" is isomorphic to Q , there are 
F/j + 2 — 2 vertices from height 0 to h — 2 by Lemma 1. Therefore, 

n + 1 = (no + 1) + 2 + (Fh +2 — 2). (2) 

Now we evaluate g(n ) — g(n 0 ). Notice that the parent of vertex n + 1 in Q is the same 
as the parent of the vertex labelled no + 1 in Q ' , but its label differs between the two trees. 
Vertices 1 or 2 cannot be the parent as n + 1 is above height 4. The difference g(n) — g{no) 
is again given by accounting for vertices 1 and 2 in Q along with the vertices in Q preceding 
the parent of n + 1 and residing in Q" . Since the parent of vertex n + 1 is at height h — 1 
of G, the vertices in Q" that precede the parent of n + 1 range from height 0 to h — 3 of 
G" . Counting as before we deduce that g(n ) — g(no) = Fh+i- Thus, since n = no + Fh + 2 
from (2), we conclude that 


n - g(n) = n 0 - g{n 0 ) + F h . 


(3) 
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We now compare 5(5(71. — 1)) with 5(5(71.0 — 1)). Except for some instances, we show that 
vertex g(g(n — 1)) in G is the same as the one labeled 5(5(770 — 1) in the separate labeling of 
G' ■ Let us assume that vertex n+ 1, considered as a vertex of Q situated in the subtree G', 
is not the first or last vertex at its height in G' ■ In that case, vertex n lies in G ' and points 
to the same vertex as the one labelled no in the separate labeling of G' ■ As such g(n — 1) 
points to the parent vertex of no in G' , which is labelled g(no — 1) in the separate labeling 
of G' ■ This parent cannot be the last vertex at its height in G', for otherwise, it would be 
the common parent of vertices no and no + 1 in G' located at the height above. This would 
make no + 1 the last vertex at its height in G ' , contradicting the assumption that vertex 
n + 1 in G, pointing to the same vertex as the one separately labeled no + 1 in G' , is not so. 

As vertex g(n — 1) of G is not the last vertex at its height in the subtree G ' , vertex 
g(n — 1) + 1 of G lies in the subtree G' and points to the same vertex as the one separately 
labelled 5(770 + 1) in G' ■ Hence the parent of vertex g(n — 1) + 1 in Q, lying at height ft — 2 
of G , also lies in G' and points to vertex 5(5(710 — 1)) in the separate labeling of G' (recall 
that ft > 4 so vertex 5(5(710 — 1)) is defined in G')- This confirms that vertex 5(5(77 — 1)) in 
G is the same as the vertex labeled 5(5(710 — 1)) in G' ■ 

The difference 5(5(71 — 1)) — 5(5(77 o — 1)) in labels once again comes from vertices 1, 2 
and all vertices in G preceding 5(5(71 — 1)) that lie on Q" . Since vertex 5(5(77 — 1)) is at 
height ft — 2 of G, the vertices preceding 5(5(77 — 1)) that lie on Q" range from height 0 to 
ft. — 4 of that subtree. There are Fh — 2 of them by Lemma 1 and thus 

g(g(n - 1)) = g(g{no - 1)) + 2 + (F h - 2) = 5(5(710 - 1)) + F h . 

By the induction hypothesis n o — 5(710) = 5(5(710 — 1)), and so by (3) we get 
n - 5(77) =n 0 - g(n 0 ) + F h = g{g{n 0 - 1)) + F h = 5(5(77 - 1)) . 

We now deal with the exceptional cases, assuming first that vertex n + 1 in G is located 
on the left subtree of vertex 1 and at heigh ft > 4. Deviating from the previous notation a 
bit, denote the left subtree containing n + 1 as Gi and the subtree emanating from vertex 
2 as Gr- Vertex n + 1 is at height ft — 1 of Gi- If it is the first vertex at height ft . — 1 of Gi 
then vertex n is the last vertex at height ft — 2 of Gr- So g(n — 1) is the last vertex at height 
ft — 3 of G r and so 5(71 — 1) + 1 is the first vertex at height h — 2 of Gi- Vertex 5(5(71. — 1)) 
is then the first one at height h — 3 of Gi ■ 

Meanwhile, when Gi is labeled separately as before, vertex n + 1 in G will again point 
to the same vertex as some vertex labeled no + 1 in Gi- Then the vertex labeled n 0 in Gi 
becomes the last vertex at height h — 2 of Gi, which implies that g(no — 1) is the last vertex 
at height h — 3 of Gi, and g(n 0 — 1) + 1 is thus the first vertex of Gi at height h — 2. So 
vertex 5(5(710 — 1)) in Gi, the parent of g{no — 1) + 1, is the first vertex at height h — 3 
of Gi- It points to the same vertex as the one labelled g(g(n — 1)) in G ■ The difference 
g(g(n — 1)) — g{g{no — 1)) is thus Fh due to the same reasons as in the previous case. Thus 
from the induction hypothesis for no, we deduce as before that n — g(n) = g{gin — 1)). 

When vertex n + 1 in G is the last vertex at height at height h— 1 of Gi , vertex g{n — 1) 
is the common parent of vertices n and n + 1, situated as the last vertex at height h — 2 of 
Gi ■ So g{n — 1) + 1 is the first vertex at height h — 2 of Gr making g(g(n — 1)) the first vertex 
at height h — 3 of Gr- On the other hand, with n 0 + 1 defined as before, vertex g{g{no — 1)) 
in Gi will be the first one at height ft, — 2 of Gi- Both vertices g(g{n — 1)) and 5(5(770 — 1)) 
are located at height ft. — 1 of G- 

One must be careful about calculating 5(5(77— 1)) — 5(5(770 — 1)) because vertex 5(5(71— 1)) 
of G occurs before the vertex pointing to 5(5(710 — 1)) in G- Although the label of the 
latter vertex on Gi, which is 5(5(770 — 1)), is less than 5(5(77 — 1)). In this situation we 
simply note that since vertex 5(5(770 — 1)) is the first one at height ft — 2 of Gi, its label is 
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F 2 H + F h _ i + 1 = F h+ 1 — 1. Similarly, as g(c/(n — 1)) is the first vertex at height h— 1 

of Q, its label is F ^ +2 - 1. Therefore g{g{n - 1)) - g{g(n 0 - 1)) = F h+ 2 - F ft+ i = FV As 
before it follows from the induction hypothesis on no that n — g(n) = g{g{n — 1 )). 

When vertex n + 1 is located on the subtree Q r , the exceptional cases are analogous to 
the previous ones. The case for vertex n + 1 being the last one at height h — 2 of Q r is 
analogous to the case when n+ 1 is the first vertex at height h— 1 of Qi. When vertex n+1 
is the first vertex at height h — 2 of Q r the situation is analogous to when n + 1 is the last 
vertex at height h— 1 of Qi . This completes our induction and the proof of the theorem. □ 

We now present a corollary which is the simplest case of Meek and Van Rees’s result on 
how G(n) acts on positive integers when written in their Zeckendorf representation. Recall 
from the introduction that the authors showed if n = F ri + • • • + F r . is the Zeckendorf 
representation of n then G(n ) = F ri _ i + ••• + F r %. We now show this for n = F r , 
that is, for the Fibonacci numbers. With more work one could get the whole result from 
the combinatorial interpretation but we will not pursue that direction because the proof 
is tedious. Instead we will prove the factorization property of the word II^Li f( n ) that 
we observed earlier experimentally, and which led to our discovery of the combinatorial 
interpretation for G(n). 

Corollary 1. g(F n ) = F n _ i for n > 2, where F n is the n th Fibonacci number. 

Proof. For n > 4, the second vertex at height n — 3 is labeled F 2 + ■ ■ ■ + F n _ 2 + 2 = F n . 
Its parent is the first vertex at height n — 4 with label F n _x — 1. Thus the parent of vertex 
F n + 1 is the second vertex at height n — 4 since the first one has vertices F n — 1 and F n 
as its children. Hence g(F n ) = F„_i. For 2 < n < 4, one can verify the claim from Figure 

1 . □ 

Corollary 2 . Define words w\ = 2, w 2 = 1,103 = 2 = W\ and w n = w n - 2 w n -i for n > 3. 
Let f(n ) denote the frequency sequence of g{n). The infinite word W = IX^i f( n ) factorizes 
asW = W!W 2 n,T= 3 w n- 

Proof. The combinatorial interpretation makes it clear that g(n) is slow-growing, and The- 
orem 1 implies that the frequency sequence of G(n ) from (1) is the same as that of g(n). We 
note that /(n) is the number of children of vertex n in Q. Indeed, if vertex n has children 
labeled from a to b then g(k) = n precisely when k ranges from a + 1 to 6 + 1. Let H+ 
denote the word produced by concatenating from left to right the values of /(n) as n ranges 
in increasing order over the vertices at height h of Q. For h > 1, the vertices at height h 
of Q are arranged by placing the vertices at height h — 2 of g to the right of the vertices 
at height h — 1 of g, following from the recursive definition of g. Thus Wh = Wh- 2 Wh-i 
with Wq = 2 and W\ = 12. We show that Wh = Wh+iWh +2 for h > 0. We have that 
Wi = 12 = W 2 W 3 and W 2 = 212 = W 3 W 4 . Assuming that Wh = u>/ l+ iW/j + 2 for 1 < h < N, 
we get that Wjv = Wn- 2 Wn-i = w n - iw nw nw n +1 = wn+iWn + 2 where the last equality 
uses the recursive definition of w n . The claim follows by induction. Finally, to finish the 
proof we note that W 0 = 2 = W\ and so 

OO OO OO OO OO 

n f( n ) = n Wh = Wa n w/,i = wi n w h +iw h +2 = wxw 2 wi. 

n— 1 h —0 h —1 h —1 h — 3 

□ 

The consequence of Corollary 2 is that it allows us to view the word W as two intertwined 
copies of itself along with the initial seeds w\ and w 2 . More precisely, the above factorization 
shows that W = WxW 2 11 ^= 3 ^ = w 3 W n w n+ 1 = W 3 nr =2 w n+2 = n,T =3 w n- What 
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we hope is that one can find analogous factorizations of the frequency sequences generated 
by the &;-fold recursions that was discussed in the introduction. In this manner one may find 
corresponding infinite trees for the fc-fold recursions and proceed to derive a combinatorial 
interpretation. 
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